Gradient-based parameter optimization for systems containing discrete-valued functions

نویسندگان

  • Edward Wilson
  • Stephen M. Rock
چکیده

Gradient-based parameter optimization is commonly used for training neural networks and optimizing the performance of other complex systems that only contain continuously differentiable functions. However, there is a large class of important parameter optimization problems involving systems containing discretevalued functions that do not permit the direct use of gradient-based methods. Examples include optimization of control systems containing discrete-level actuators such as on/off devices, systems with discrete-valued inputs and outputs, discrete-decision-making systems (accept/reject), and neural networks built with signums (also known as hard-limiters or Heaviside step functions) rather than sigmoids. Even if most of the system is continuously differentiable, the presence of one or more discrete-valued functions will not allow gradient-based optimization to be used directly. A new algorithm, ‘noisy backpropagation,’ is developed here, as an extension of backpropagation, which solves this problem and extends gradient-based parameter optimization to permit application to systems containing discrete-valued functions. Moreover, the modification to backpropagation is small, requiring only (1) replacement of the discrete-valued functions with continuously differentiable approximations, and (2) injection of noise into the smooth approximating function on the forward sweep during training. Noise injection is the key to reducing the round-off error created when the discrete-valued functions are replaced after training. This generic approach is applicable whenever gradient-based parameter optimization is used with systems containing discrete-valued functions; it is not limited to training neural networks. The examples in this paper demonstrate the use of noisy backpropagation in training two different multi-layer signum networks and in training a neural network for a control problem involving on-off actuators. This final example includes implementation on a laboratory model of a ‘free-flying space robot’ to validate the realizability and practical utility of the method. Copyright # 2002 John Wiley & Sons, Ltd.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the hybrid conjugate gradient method for solving fuzzy optimization problem

In this paper we consider a constrained optimization problem where the objectives are fuzzy functions (fuzzy-valued functions). Fuzzy constrained Optimization (FO) problem plays an important role in many fields, including mathematics, engineering, statistics and so on. In the other side, in the real situations, it is important to know how may obtain its numerical solution of a given interesting...

متن کامل

SIZING OPTIMIZATION OF TRUSS STRUCTURES WITH NEWTON META-HEURISTIC ALGORITHM

This study is devoted to discrete sizing optimization of truss structures employing an efficient discrete evolutionary meta-heuristic algorithm which uses the Newton gradient-based method as its updating scheme and it is named here as Newton Meta-heuristic Algorithm (NMA). In order to enable the NMA population-based meta-heuristic to effectively explore the discrete design space, a term contain...

متن کامل

A class of multi-agent discrete hybrid non linearizable systems: Optimal controller design based on quasi-Newton algorithm for a class of sign-undefinite hessian cost functions

 In the present paper, a class of hybrid, nonlinear and non linearizable dynamic systems is considered. The noted dynamic system is generalized to a multi-agent configuration. The interaction of agents is presented based on graph theory and finally, an interaction tensor defines the multi-agent system in leader-follower consensus in order to design a desirable controller for the noted system. A...

متن کامل

Two Settings of the Dai-Liao Parameter Based on Modified Secant Equations

Following the setting of the Dai-Liao (DL) parameter in conjugate gradient (CG) methods‎, ‎we introduce two new parameters based on the modified secant equation proposed by Li et al‎. ‎(Comput‎. ‎Optim‎. ‎Appl‎. ‎202:523-539‎, ‎2007) with two approaches‎, ‎which use an extended new conjugacy condition‎. ‎The first is based on a modified descent three-term search direction‎, ‎as the descent Hest...

متن کامل

Some new variants of interval-valued Gronwall type inequalities on time scales

By using an efficient partial order and concept of gH-differentiability oninterval-valued functions, we investigate some new variants of Gronwall typeinequalities on time scales, which provide explicit bounds on unknownfunctions. Our results not only unify and extend some continuousinequalities, but also in discrete case, all are new.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002